Building Ontologies from Textual Resources: A Pattern Based Improvement Using Deep Linguistic Information
نویسندگان
چکیده
Ontologies are a key component for several applications. Ontologies are often built by hand, but automatizing the process of ontology building has been and is even more recognized as very important for scaling and speeding up this process. However, several difficulties have been identified, some of them are quite fundamental. In this paper, we present our work for overcoming some of the fundamental difficulties. Our work resulted in improvements of an existing ontology building tool (Text2Onto). The contribution of our work consists in the creation of a flexible language (DTPL—Dependency Tree Patterns Language) for expressing patterns as syntactic dependency trees to extract semantic relations, and making an existing ontology building tool (Text2Onto) able to use them. DTPL allows to exploit deep linguistic information (related to coreference resolutions, conjunctions, appositions, passive verbal phrases, etc.) provided by deep syntactic analysis of the text, and also (in order to improve the accuracy of patterns) to express the exclusion of some dependency bindings in patterns.
منابع مشابه
Multilingual Ontologies and English- Bulgarian Ontology Development
In this paper we make a short survey of the approaches for development of multilingual ontologies. Our main goal is to find appropriate approach for development of multilingual ontologies, including Bulgarian language terminology. We propose a collaborative methodology for development of English-Bulgarian bilingual ontologies by usage of information extraction from e-learning textual content, l...
متن کاملTowards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems
We propose applying standardized linguistic annotation to terms included in labels of knowledge representation schemes (taxonomies or ontologies), hypothesizing that this would help improving ontology-based semantic annotation of texts. We share the view that currently used methods for including lexical and terminological information in such hierarchical networks of concepts are not satisfactor...
متن کاملTowards Cross-Media Feature Extraction
In this paper we describe past and present work dealing with the use of textual resources, out of which semantic information can be extracted in order to provide for semantic annotation and indexing of associated image or video material. Since the emergence of semantic web technologies and resources, entities, relations and events extracted from textual resources by means of Information Extract...
متن کاملArabOnto: experimenting a new distributional approach for building Arabic ontological resources
Ontologies are useful for modelling and retrieving knowledge in complex information systems. Ontology construction environments use statistical and linguistic information to extract knowledge from corpora. Within the great improvement in this field, there is a need to introduce the Arabic language in these environments. We present the ArabOnto architecture modelling the process of Arabic ontolo...
متن کاملMethodology to Build Medical Ontology from Textual Resources
In the medical field, it is now established that the maintenance of unambiguous thesauri goes through ontologies. Our research task is to help pneumologists code acts and diagnoses with a software that represents medical knowledge through a domain ontology. In this paper, we describe our general methodology aimed at knowledge engineers in order to build various types of medical ontologies based...
متن کامل